Intra - Disk Parallelism
نویسندگان
چکیده
Power is a big problem in data centers and a significant fraction of this power is consumed by the storage system. Server storage systems use a large number of disks to achieve high performance, which increases their power consumption. In this thesis, we explore an architecture that can significantly reduce the power consumed by the storage system via intra-disk parallelism, wherein disk drives can exploit parallelism in the I/O request stream. Intra-disk parallelism can facilitate replacing a large disk array with a smaller one, using the minimum number of disk drives needed to satisfy the capacity requirements. We present an historical retrospective of intra-disk parallelism and show how technology has changed over time creating new demand for efficient designs. We also show that the design space of intra-disk parallelism is large and present a taxonomy to formulate specific implementations within this space. Using a set of commercial workloads, we conduct a bottleneck analysis to identify the key performance bottlenecks that arise when a storage array that is tuned to provide high performance is replaced with a single high-capacity disk drive. These are the bottlenecks that intra-disk parallelism would need to alleviate. We then explore a particular intra-disk parallelism approach, where a disk is equipped with multiple arm assemblies that can be independently controlled, and evaluate three disk drive designs that embody this form of parallelism. We show that it is possible to match, and even surpass, the performance of a storage array for these workloads by using a single disk drive of sufficient capacity that exploits intra-disk parallelism, while significantly reducing the power consumed by the storage system compared to the multi-disk configuration. We then evaluate the performance and power consumption of disk arrays composed of intra-disk parallel drives and compare it with conventional multi-disk configurations. Intra-disk parallelism reduces the power consumed at the disk drives by
منابع مشابه
Controlling Disk Contention for Parallel Query Processing in Shared Disk Database Systems
Shared Disk database systems offer a high flexibility for parallel transaction and query processing. This is because each node can process any transaction, query or subquery because it has access to the entire database. Compared to Shared Nothing, this is particularly advantageous for scan queries for which the degree of intra-query parallelism as well as the scan processors themselves can dyna...
متن کاملAnalysis of Parallel Scan Processing in Shared Disk Database Systems
Shared Disk database systems offer a high flexibility for parallel transaction and query processing. This is because each node can process any transaction, query or subquery because it has access to the entire database. Compared to Shared Nothing database systems, this is particularly advantageous for scan queries for which the degree of intra-query parallelism as well as the scan processors th...
متن کاملMPL-Adaptive Algorithms for Multisegmented Disk Caches
This paper investigates the performance of a multisegmented disk cache processing multiple le scans. Multisegmented caches are designed to support concurrent le scans, which is crucial for providing both intra-query and inter-query parallelism. A multisegmented cache attached to an individual disk usually consists of a constant number of segments. A traditional round-robin segment replacement p...
متن کاملObject Placement in Parallel Object Oriented Databases
This thesis explores the issue of object placement in the context of parallel object oriented databases. The particular parallel hardware architecture targeted is the shared nothing architecture where each node has its own processor, memory and disk. The particular query parallelisation method employed is intra-operator parallelism where each node works on the portion of the data set which resi...
متن کاملDynamic Load Balancing in Parallel Database Systems
Parallel database systems have to support both inter-transaction as well as intra-transaction parallelism. Inter-transaction parallelism (multi-user mode) is required to achieve high throughput, in particular for OLTP transactions, and sufficient cost-effectiveness. Intratransaction parallelism is a prerequisite for reducing the response time of complex and dataintensive transactions (queries)....
متن کامل